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ABSTRACT 


This  paper  introduces  a  class  of  finite -memory  determin- 
istic algorithms  for  the  following  problem  of  hypotheses  test- 
ing under  a  finite  memory  constraint.   Let  X1,X-,X_,...  be  a 
sequence  of  independent,  identically  distributed  Bernoulli 
random  variables  where  X.  can  take  on  values  H  or  T.   The 
problem  is  to  decide  between  the  two  simple  hypothesis 
H    :  P(Xi=H)  =  p  vs.  T    :  P(X.=H)  =  q,  where  P(#  is  true)  =  tt 
=  P(T  is  true)  =  tt..  =  h.      The  X.'s  are  observed  sequentially 
and  a  new  decision  must  be  formulated  after  each  observation. 
Let  the  data  be  summarized  after  each  new  observation  by  a 
2n-valued  statistic  VeS  =  {1,  ,  2,  ,  .  .  .  ,n,  ,n  ,  (n-  1)  , .  .  .  ,  2  , 1  }  , 
which  is  updated  according  to  the  rule  V\  =  f  (V,..  ,X,)  ,  where 
f  :  S  X  (H,T)-*S;  is  the  transition  function.   Let  the  decision 
rule  take  action     [  H   is  true  if  V,e  (1,  ,  2,  ,  .  .  .  ,n,  ) 

d(vk)  J 

[r  is  true  if  Vke(nt,n-lt,. .. ,2t,lt) , 
at  time  k.   The  objective  is  to  find  the  function  f  which 
minimizes  the  probability  of  error  P(e)  =  tt   P(d  =  T\H   is 
true)  +  tt   P(d  =  H\T   is  true). 

The  algorithm  may  be  taught  of  as  a  finite  state  automaton, 
in  which  the  inputs  are  the  observations,  the  outputs  are  the 
decisions,  and  the  states  constitute  the  memory.   In  this  paper, 
the  optimal  algorithms  are  found  for  a  small  number  of  states 
(up  to  20)  . 


TABLE  OF  CONTENTS 

I.  INTRODUCTION 9 

II .  DESCRIPTION  OF  THE  ALGORITHM 21 

III.  DETERMINATION  OF  ERROR  PROBABILITY 25 

IV.  DETERMINATION  OF  (A  /B  )*  and  f*  FOR  SMALL  n 38 

APPENDIX  A 47 

APPENDIX  B. 73 

APPENDIX  C 76 

LIST  OF  REFERENCES 78 

INITIAL  DISTRIBUTION  LIST 79 


LIST  OF  TABLES 


I.  Optimal  Algorithm  f*  =  <  r(2),  ...  ,r(n)  >,r(i)  = 
State  that  the  Transition  Goes  to  From  i 42 

II.  Optimal  Algorithm  f*  =  <  p(2),  ...  ,p(n)  >,p(i)  = 
Number  of  States  that  the  Transition  Transits  from 
State  i 43 


LIST  OF  FIGURES 

1.  The  Transition  Function,  f 17 

2.  An  Algorithm  £  =   <p(l),  ...  ,p(n)>  19 

3.  An  Algorithm  f  =   <r(2),  ...  ,r(n)>  23 

4.  The  Transition  Matrix  for  the  Case  Where  n  =  4 

and  f  =  <  1,1,3  > 24 

5.  Natural  Logarithm  of  (A  /B  )*  versus  n 44 

6.  Natural  Logarithm  of  (A  /B  )*  versus  p 45 

7.  Natural  Logarithm  of  (A  /B  )*  versus  p 46 


ACKNOWLEDGEMENT 

The  invaluable  advice  and  assistance  of  Professor  Bruno 
Shubert  is  gratefully  acknowledged. 


I.   INTRODUCTION 

In  the  near  future,  computers  will  play  even  more  im- 
portant roles  in  every  day  life  than  ever.   We  are  expecting 
more  compact  machines  with  more  working  capabilities  and 
requiring  less  human  intervention.   The  idea  of  designing 
a  machine  with  decision  making  capability  is  of  great  inter- 
est.  If  such  a  machine  were  to  be  employed  in  a  decision 
process,  then  we  face  the  problem  of  trading  off  the  error 
caused  by  the  machine  operation  against  its  complexity.   In 
a  smaller  size  machine  with  limited  core  memory,  the  error 
caused  by  the  decision  under  finite  memory  is  even  more 
significant.   The  attempt  of  designing  a  small  machine  with 
limited  core  size  which  operates  with  least  probability  of 
error  is  worth  of  time  and  effort,  especially  if  such  a 
device  is  to  be  used  for  some  special  tasks.   Consider  an 
exploration  of  a  distant  star  with  an  unmaned  spacecraft. 
If  a  computer  with  a  decision  making  capability  were  to  be 
used  within  the  spacecraft,  it  would  almost  certainly  have 
to  be  small  and  of  limited  core  size.   This  kind  of  machine 
or  automaton  would  be  required  to  make  decision  with  minimum 
probability  of  error  constrained  by  the  available  memory. 
In  this  example,  the  machine  or  automaton  acts  like  an 
input-output  machine  which  has  the  observed  data  as  the  in- 
put.  The  output  of  the  machine  are  the  decisions.   Only  a 
finite  amount  of  informations  can  be  stored  at  any  time.   To 


construct  such  an  automaton,  a  form  of  a  decision  process 
could  probably  be  adapted  from  the  statistical  test  of 
hypotheses . 

Let  X..,X_,...  be  a  sequence  of  independent,  identically 
distributed  random  observations  drawn  according  to  a  proba- 
bility measure  T   defined  on  arbitrary  probability  space. 
Consider  the  simple  hypotheses  testing  problem 

HL:  T  =  T  vs.    H, :  T   =  2\  . 

0       o  11 

Let  the  prior  probabilities  of  the  null  and  alternative 
hypotheses  be  denoted  by  it   and  tt-  respectively.   The  goal 
is  to  find  a  sequence  of  decision  rules  d..  (X.. )  ,  d~  (X..  ,X~)  ,  .  .  , 
which  minimize  the  asymptotic  probability  of  error 


where 


P(e)  -  e[  lim   i   .8,  el 

v  J  |  n->°°  n   i=l   ll 

f  1  if  di  /  H  true, 
e .  =  < 

1    [  0  if  d.  =  H  true. 


(1) 


Denoting,  for  a  sample  of  size  n,  a   the  probability  of 
type  I  error  and  3   the  probability  of  type  II  error  it  is 
well  known  that  a   and  £  will  exponentially  approach  zero 
as  n  becomes  large.   However,  the  decision  d   depends  on 
(X1 ,X? , . . . ,X  ),  so  that,  as  n  increases,  the  amount  of  data 
to  be  stored  increases  without  bounds.   Some  means  of  data 
reduction  may  therefore  be  desirable.   Sufficient  statistics 
can  sometimes  be  used  to  reduce  the  required  size  of  memory. 
These  statistics  lose  no  information  when  used.   Unfortunately, 
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the  following  example  shows  that  this  type  of  data  reduction 
is  sometimes  misleading. 

Let  X  "*  N(u,l)  with  the  hypotheses 


where 


HQ:  y  =  1 


%  =  ul  =  h' 


vs 


H1:  u  =  -1 


A  sufficient  statistic  for  this  test  is  Y 


n 


n 
1=1   l ' 


where  Y   is  the  value  of  the  statistic  after  n  observations 
n 

A  simple  optimal  decision  scheme  is 


d   = 

n 


{     Hn   if  Y   >  0 
0      n  - 


H,   if  Y   <  0. 
1      n 


The  updating  rule  for  Y   is  given  by 

Y  xi  =  Y   +  x  xi • 
n+1    n    n+1 

Thus  Y   contains  all  the  desired  informations  about 
n 

fX1,X_,...,X  )  ,  and  only  Y  needs  to  be  recorded  at  the  time 
*■  1'  2 '    '  nJ  '  }      n 

n.   However,  Y   is  real-valued  so  that  a  potentially  infinite 
storage  is  needed  for  it  alone.   If  the  memory  can  store  a 
real  number  then  it  can  store  any  number  of  real  numbers  so 
that  no  saving  is  actually  achieved.   The  attempt  of  round- 
ing off  the  sufficient  statistic  to  some  finite  number  of 
digits  need  not  be  optimal.   In  fact,  Cover  [2]  has  shown 
that  the  error  probability  need  not  tend  to  zero  if  rounded 
off  statistic  is  used.   If  constrained  by  finite  memory, 
some  other  statistic  must  be  devised  in  cooperation  with 
some  appropriate  decision  rule.   Before  further  discussion 
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of  this  topic,  it  would  be  worthwhile  to  look  at  the  liter- 
ature survey  concerning  a  decision  model  constrained  by 
finite  memory.   The  first  mention  of  finite  memory  in  con- 
nection with  a  statistical  decision  problem  is  to  be  found 
in  [1].   This  paper  by  Robbins  discussed  the  problem  of 
choosing  one  of  two  ways  of  action,  each  of  which  may  lead 
to  success  or  failure,  in  such  a  way  as  to  maximize  the 
long-run  proportion  of  successes  obtained.   The  choice  each 
time  is  allowed  to  depend  only  on  a  finite  number  of  past 
observation.   Suppose  we  have  two  coins  and  we  wish  to 
maximize  the  number  of  heads  thrown  during  a  sequence  of 
tosses.   If  we  had  prior  knowledge  of  which  coin  has  the 
larger  probability  of  turning  up  a  head,  we  should  use  it 
exclusively,  irrespective  of  the  outcomes  of  previous  tosses 
Then,  with  probability  1 

lim  number  of  heads  in  first  n  tosses        ,  . 

=  max  (p.,  ,PoJ  , 

n-*-°°  n  \.fi>f2J> 

where 

p.  =  probability  of  head  for  the  i —  coin. 

If  no  information  about  p1  and  p?  is  available,  then 
there  still  is  some  decision  rule  which  asymptotically 
achieves  the  above  limit.   Robbins  had  come  up  with  the  fol- 
lowing decision  rule,  said  to  be  of  type  r  if  the  decision 
as  to  which  coin  will  be  used  for  the  n —  toss  depends  only 
on  the  results  of  tosses  n-r ,n-r  +  l , . . . ,n- 1 .   "Define  the 
rule  R,  as  follows:   start  tossing  with  coin  1,  stop  if  the 
first  toss  is  a  tail,  otherwise  continue  tossing  until  the 
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first  run  of  r  consecutive  tails  occurs  and  then  stop.   This 
defines  the  first  block  of  tosses  with  coin  1.   Now  start 
tossing  with  coin  2  and  apply  the  same  rule,  obtaining  the 
first  block  of  tosses  with  coin  2.   Then  start  again  with 
coin  1  and  apply  the  same  rule,  obtaining  the  second  block 
of  tosses  with  coin  1  and  so  on  indefinitely,  thus  generating 
an  infinite  sequence  of  tosses  consisting  of  alternate  blocks 
of  tosses  with  coin  1  and  2."   With  rule  R  ,  of  type  r  so 
defined  Robbins  proved  that  with  probability  1 

lim  number  of  heads  in  first  n  tosses  .  ^1^   *z     *z    ^1 


(1-Pl)r  *    d-P2)r 
and  note  that 

lim  PtCI-P^1"  *   P2  Cl-Pl)r 

n~  7j oF— ~ TF-  =  max  CPi.Pa). 

(l-p1)  +  (l-p2) 

So  the  rule  R   is  the  best  among  the  other  of  type  r 
which  maximize  the  long-run  proportion  of  heads  obtained. 

Although  Robbins  in  his  paper  did  not  consider  hypotheses 
testing  problem,  the  paper  nevertheless  stimulated  the  idea 
of  using  a  finite  memory  in  statistical  decision.   By  using 
the  same  m-finite  past  memory  as  in  [1],  Cover  [2]  developed 
a  decision  rule  using  a  4  states  memory  alogrithm  for  the 
hypotheses  testing  problem  as  applied  to  Bernoulli  trials. 
The  details  of  this  can  be  found  in  [2] . 

The  first  actual  finite  state  memory  algorithm  was  pro- 
posed in  [3]  and  [4]  by  Hellman  and  Cover.   Here  the  past 
observations  were  stored  as  a  state  of  a  machine  constrained 
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to  have  only  m  states.   Consider  the  two  hypotheses  testing 
problem  as  stated  earlier,  where 

Hn:  T   =  T  vs      H  •  T   =  T . 

0       o  1       1 

and  the  prior  probabilities  P(Hn  is  true)  =  tt   and  P(H..  is 

true)  =  tt-. 

After  each  observation,  let  the  data  be  summarized  by 

an  m-valued  statistic  Ve { 1 , 2 , . . . ,m}  where  the  updating 

scheme  is  Vn  =  f(Vn_1,  Xn)  ;  f :  {1 ,2  , .  .  .  ,m}XW->{l  ,2  , .  .  .  ,m} 

where  X.eW  the  set  o£  values  which  X.  can  take  on  for 
l  i 

i  =  1,2,...  and  f  is  the  transition  function. 

Let  the  decision  rule  be  defined  as 

dn  =  d(Vn);  d  :  {1,2,.  .  .  ,m}  ■+   {HQ,H1}. 

The  pair  (f,d)  then  describes  a  finite-state  automaton  with 

inputs  X   and  outputs  d   =  d (V  )  and  state  space  S  =  {1,2, 

. . . ,m} .   Here  V   is  the  state  at  time  n.   Under  hypotheses 
n 

Hn  or  H1 ,  the  sequence  V  ,  with  some  specific  initial  state, 
forms  a  Markov  chain  over  the  state  space  S.   In  order  to 
minimize 


P(e)  =  E 


f11™  i  .!,  el 
I  n->°°  n  i  =  l   l  J 


where  e.  is  the  error  as  defined  in  (1),  we  need  to  find  the 

optimal  pair  (f,d).   Hellman  and  Cover  have  established  a 

lower  bound  for  P(e)  as  follows. 

Let  f„  and  f..  be  the  probability  densities  of  the  sample 

under  the  respective  hypotheses  with  respect  to  a  dominating 

f0(x) 
measure.   Define  the  likelihood  ratio  to  be  £(x)  =  -r — r— y  . 
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Let  I   denote  the  essential  supremum  of  £(x)  and  A  the  es- 
sential infimum  o£  I (x)  where  the  supremum  and  infimum  are 
taken  over  all  measurable  sets  with  positive  dominating 
measure.   Define  y  =  l/l_.      Then  for  an  irreducible   in- 
state automaton,  we  have  P(e)  >  P*  where 


2/tt  ir-Y     "•  1 

Px  =  mm  <  = >  tt  ,tt1  i  . 

ym-l  _  1     o'  1 

It  was  further  proved  in  [3]  that  the  same  bound  P*  also 

hold  for  P(e)  in  a  reducible  (m+1) -state  automaton.   For  a 

special  case  where  tt   =  ir1  =  %,  the  P*  for  the  irreducible 
m-state  automaton  becomes 


If  the  m-state  automaton  is  reducible,  the  bound  becomes 

P*  =      1 

YJsCm-2)+  -  •   • 

The  ratio  y  =  ^/A  ^s  a  measure  of  the  separation  between 
the  two  hypotheses.   Notice  that  P*  decreases  exponentially 
with  m. 

However,  Hellman  and  Cover  have  shown  that,  except  for 
degenerate  cases,  no  machine  can  actually  achieve  the  bound 
P* .   But  as  an  example  in  Bernoulli  case,  an  e-optimal  class 
of  automata  (i.e.,  such  that  for  any  e  >  0  there  exist  an 
automaton  with  P(e)  <  P*  +  e)  was  introduced.   The  detailed 


We  call  the  automaton  irreducible  (reducible)  if  the 
result  Markov  chain  formed  by  sequence  {V  }  under  both 
hypotheses  is  irreducible  (reducible).    n 
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description  of  this  class  of  automata  is  as  follows:   Let 
X1 ,X_  , . . .  be  a  sequence  of  independent,  identically  dis- 
tributed Bernoulli  random  variables  where  X.  can  take  on 

1 

value  H  or  T,  let  the  two  hypotheses  be 

11    :  P(X.=H)  =  ph      vs.    T    :  P(X.=T)  =  pt 

where  the  prior  probabilities  are  it   =  ir-  =  h   and  q.  =  1-p- 
f or  i  =  h,t . 

Without  loss  of  generality  it  may  assume  that  p,  >  p  , 

in  which  case,  I  =  Ph/Pt ;  1  ■  %^t   and  Y  =  ^-  =  phqt/'ptqh' 
If  the  hypotheses  are  symmetric,  that  is  if  p,  =  1  -  p  , 
then  Y  =  (Ph/qh)2;  hence  P(e)  >  l/(l  +  (ph/qh)m"1)  for  an 
irreducible  m- state  automaton,  and  P(e)  >  l/l  +  (Pu/^u)    ) 
for  a  reducible  m-state  automaton. 

Let  the  transition  function  f  be  defined  as  follows  (see 
Figure  1) : 

(i+1    if  X  =  H 
i-1    if  X  =  T 

for  i  =  2,3,... ,m-l 

12      with  probability  6  >  0  if  X  =  H, 
1      otherwise, 

1m-l    with  probability  kfi  >  0  if  X  -  T 
m      otherwise, 

where  k  =  1  for  the  case  of  symmetric  hypotheses. 
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6H 


1        2       3 
T      T 


Figure  1 


The  transitions  are  made  to  the  adj acent , states  only 
when  the  event  X  =  H  or  X  =  T  are  observed.   Otherwise,  the 
state  does  not  change.   This  automaton  will  reach  an  end 
state  only  on  strong  evidence  to  support  the  corresponding 
hypotheses  (i.e.,  to  enter  state  m,  preference  on  H   is 
strongly  and  vice  versa  for  T) .   However,  leaving  the  end 
states  has  only  a  very  small  chance  when  6  is  small.   A 
decision  made  at  the  end  states  results  in  the  smallest 
probability  of  error  and  as  6  ■*  0 ,  the  error  probability 
P(e)  should  asymptotically  approach  P* . 

The  Hellman-Cover  algorithm  is  useful  in  providing 
sequences  of  decisions,  but  not  very  suitable  for  the  case 
when  only  a  single  decision  is  required.   The  irreducible 
automaton  will  asymptotically  approach  the  lower  bound  for 
P(e)  after  a 'large  enough"  number  of  observations  and  there 
is  no  way  to  tell  how  large  the  number  should  be.   In  case 
when  we  need  a  more  reliable  decision  we  may  have  to  wait 
for  a  long  time.   In  addition,  the  automaton  requires  arti- 
ficial randomization  for  generating  the  probabilities  6  and 
k6  to  transit  out  of  the  end  states.   Such  a  random  generator 
needs  additional  memory  to  be  added  to  the  automaton  and  it 
will  no  longer  be  a  finite  state  automaton  if  we  need  a  very 

small  6  as  close  to  zero. 
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On  the  same  problem  of  the  Bernoulli  case,  viewed  as 
two-ways  Bernoulli  classification  problem,  Shubert  had 
introduced  in  his  paper  [8]  a  deterministic  machine  which 
can  perform  as  well  as  optimal  randomized  machines  only  if 
the  machine  memory  is  increased  by  less  than  one  bit.   This 
class  of  algorithm  use  the  data  source  itself  to  provide 
the  necessary  randomization.   The  problem  of  finding  a 
truly  deterministic  algorithm  has  been  considered  by  Ander- 
son in  his  thesis  [5].   He  developed  a  special  class  of 
symmetric  (2n+3) -state  algorithms  with  two  absorbing 
states.   The  algorithm  can  perform  a  decision  process  with- 
out randomization. 

He  defined  the  transition  function  f  as  follows  (see 
Figure  2) : 

f(s,H)  =  s+1  ,    f(s,T)  =  s-p(s)  if  s  =  l,2,...,n 

f(s,T)  =  s-1  ,    f(s,H)  =  s  +  p(s)  if  s  =-1,-2,. . .  ,-n 

f(s,H)  =  1    ,    f(s,T)  =  -1  if  s  =  0 

f(s,H)  =  s    ,    f(s,T)  =  s  if  s  =  ±  (n+1) 
where  1  <  p(s)  <  s. 

This  algorithm,  however,  has  higher  probability  of 
error  than  the  randomized  machine.   But  it  was  shown  in  [5] 
that,  if  the  randomization  is  provided  for  this  determinis- 
tic machine,  the  Hellman-Cover  lower  bound  for  P(e)  can  be 
approached  arbitrarily  closely. 

In  this  thesis,  another  class  of  the  deterministic 
algorithm  is  introduced  and  investigated.   The  class  is  of 
ergodic  type  and  it  is  the  only  characteristic  in  which  it 
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d=H 


n+1 


H 


s-p(s) 


START 

m 


H 


-1 


H 


s  +  p(s) 


H 


d=T 


n 


-(n+1) 


Figure  2.   An  Algorithm  £  =  <p(l),  ...  ,  p(n)  > 
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differs  from  the  one  discussed  by  Anderson  [5].   Since  there 
were  a  lot  of  similarities  in  this  paper  and  Anderson's 
paper,  the  sections  concerning  the  proof  of  achieving  the 
bound  with  randomization  and  deriving  the  asymptotic  bound 
for  P(e)  are  omitted.   However,  the  determination  of  the 
probability  of  error  in  terms  of  the  algorithm  is  presented 
in  the  complete  form.   The  search  of  optimal  algorithms  for 
the  cases  of  small  number  of  states  have  been  done  by  alge- 
braic computation  in  some  cases  and  by  computer  search  for 
the  other.   The  results  are  summarized  in  Table  I  and  Table 
II  with  Figures  5,  6  and  7  to  provide  clearer  idea  of  the 
trend  for  larger  number  of  states. 
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II.   DESCRIPTION  OF  THE  ALGORITHM 

■Let  X- ,X? ,X, , . . .  be  a  sequence  of  independent  identically 
distributed  Bernoulli  random  variables  where 

X.eG  =  {X  :  X  =  H  or  T}   for  all  i  =  1,2,3,... 
Consider  the  simple  hypothesis  testing  problem 

B    :  P(X.  =  H)  =  p   vs.    T    :  P(X.  =  H)  =  q, 
where 

%  <  p  <  1  and  q  =  1  -  p. 

Denote  the  prior  probability  of  H   being  true  as  ir,  and 
denote  tt  ,  the  prior  probability  of  T   being  true.   Here  only 
the  case  it,  =  it,  =  h ,  the  symmetric  hypotheses  testing  prob- 
lem will  be  considered.   The  sequence  of  random  variables 
X,  ,X2  ,X., , .  .  .  can  be  viewed  as  successive  tosses  of  a  coin 
which  is  biased  towards  heads  under  hypothesis  H   or  biased 
towards  tails  under  hypothesis  T. 

Let  r  =  {r(2),r(3),...,r(n)}  be  a  sequence  of  positive 
integers  such  that  1  <  r(i)  <  i  for  i  =  2,3,...,n;  where  n 
is  any  positive  integer. 

With  each  r_  we  associate  a  finite-memory  symmetric 
algorithm  (M,f,d)  (See  Figure  3),  where  M  is  defined  to  be 
a  set  of  2n  states  such  that  M  =  {  (1  ,h) , (2 ,h) , . . . , (n ,h) , (n , t) , 
. . . , (2 , t) , (1 , t) } .   The  subscripts  h  and  t  indicate  the  states 
in  which  the  decision  in  favor  of  the  hypothesis  H   and  T 
respectively  is  made.   In  other  words,  the  decision  rule  d 
is  d{(i,h)}  =  H   and  d{(i,t)}  =  T    for  i  =  1,2,. ..,n. 
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The  transition  function,  £  is  defined  by 


f{(i,h),H}  =  (r(i),h) 


£{(i,t),T}  =  (r(i),t) 
f{(i,h),T}  =  ((i+l),h) 


£{(i,t),H}  =  ((i+l),t) 


£{(l,h),H>    = 

(l,h) 

f{(l,t),T}    = 

d,t) 

£{(n,h),T}    = 

(n,t) 

f{(n,t) ,H}    = 

(n,h) 

if  i  =  2,3, ... ,n 


if  i  =  1,2 ,. . . ,n 


With  the  sequence  X..,X_,X_,...  of  independent  identically 
distributed  random  variables  as  the  input,  the  states  of  the 
algorithm  (M,f,d)  form  an  ergodic  Markov  chain.   The  transi- 
tion probabilities  are 

P{(i,h)  —  (r(i),h)}  =  p  \ 

>   if  i  =  2  ,3, . . . ,n 

P{(i,t)  —  (r(i),t)}  =  q  ) 


P{(i,h)  —  (i+l,h)}   =  q 


P{(i,t)  -->  (i  +  l,t)}   =  p 


if  i  =  1,2,...  ,n- 1 


P{(l,h)  -  (l,h)} 
P{(l,t)  -->  (l,t)} 
P{(n,h)  ~*  (n,t)} 
P{(n,t)  -  (n,h)} 
under  the  hypothesis  H 


=   P 

■  q 

■  q 

■  p 

Under  the  hypothesis  T,    the  transition 
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H 


(l,h) 


(2,h) 


r(i),h) 


(i,h) 


(r(i),t) 


Figure  3.   The  Algorithm  f  =  <  r (2) ,r (3) , . . . , r (n)  >;  where 
(r(i)jj)  is  the  state  that  the  transition  transits  from 
(i>j)  >j    =   h,t. 
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probabilities  have  the  same  form  as  above  with  p  and  q  are 
interchanged  in  places. 

From  now  on,  the  specific  form  of  the  algorithm  (M,f,d) 
will  be  denoted  byf   =<r>=<  r (2) ,r (3) , . . . ,r (n)  >. 
Figure  4  shows  the  algorithm  and  the  transition  matrix  of 


Transition  Diagram  for  f.  =  <  1,1,3  > 
(l,h)   (2,h)   (3,h)   (4,h)  (4,t)   (: 


(l,h) 

P 

(2,h) 

P 

(3,h) 

P 

(4,h) 

0 

(4,t) 

0 

(3,t) 

0 

(2,t) 

0 

(l,t) 

0 

Figure   4. 
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Illustration  of  Algorithm  Diagram  and  the  Transition 
Matrix  for  the  Case  of  n  =  4  and  f . (r (2) ,r (3) ,r (4) ) 
=  <  1,1,3  >. 
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III.   DETERMINATION  OF  ERROR  PROBABILITY 

Let  Sh  =  {(l,h),(2,h),...,(n,h)},  let  &t  =  { (n, t) , (n-1 , t) , 
. . . , (2 ,t) , (1, t) }  be  the  two  subsets  of  the  set  of  states  M. 
Define  V(s),  se  SAJ  S      be  its  stationary  probabilities  and 
let 

^ShD  -  SIS  VCS)  and  u(St)  =  s|s  Hs) 

n  t 

be  stationary  probabilities  of  the  state  being  in  S,  and  S 
respectively.   With  the  decision  rule  d,  the  probability  of 
error  can  be  written  as 

P(e)  =  it,  P(d  =  T\H)    +  tt   P(d  =  H\'T) 

=   h   P(seSt|lO  +  h   P(seSh|r). 

By  the  symmetry  of  the  algorithm  we  have 
P(seS  \H)    =  P(seSh|T)  that  is 

PCe)  =  P(seSt|ff) 

P(seSt|ff) 


-1 


P(seS  |fl)  +  P(seSh|ff) 

I        PCsEsh[g3    y1         i        ucshj  ) 
=    Y      p(SEst|H)    J        =    y      VJKJ) 

In  order  to  obtain  explicit  expression  for  the  probability 
of  error  in  term  of  the  algorithm  f  ,  we  now  prove  the  fol- 
lowing proprosition. 


Proprosition  1:   Let  f   =  <  r (2) , r (3) , . . . , r (n)  >  be  given, 


Then, 
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where  A  and  B   are  polynomials  in  q  and  p  respectively 


satisfying  the  recurrence  relations: 

n     n 


n+1    n    r£=r(n+lj   *n 


-£ 


,  A-  =  1    (a) 


B 


n 


n+1  =  Pn  +  Mr(n+1)  B*p 


n 


-I 


,    B-  =  1    (b) 


Hence,  both  A   and  B   have  integral  coefficient  and  are  of 
n      n  ° 

degree  less  than  n. 

Proof  of  Proprosition  1 

Let  P  be  the  transition  matrix  for  the  chain  f  ,  where 

n 

first  n  rows  and  columns  correspond  to  states  (l,h),(2,h), 
...,(n,h)  and  the  following  n  rows  and  columns  to  states 
(n,t),(n-l,t) ,. .. ,(l,t). 

Let  y  =  (u(l,h) , . . . ,u  (n,h) ,y (n,t) , . . . ,y(l,t))  be  the 
stationary  distribution,  so  that 


V(I  "  P)  =  0  , 
where  I  is  the  identity  matrix, 
four  submatrices  in  the  form 


(1) 
Partition  the  matrix  P  into 


where  P,  is  an  nxn  matrix  of  the  form 
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ph  = 


p  q 

p   0   q 


All  O's 


0   q 


0   q 


column: 


1   2 


r(i)   i 


and  P   is  an  nxn  matrix  of  the  form 


0 
P 


Pt  = 


0    p 


column 


0 
0 
n 


P  0   q 

p   0  q 

p  q 

i   r(i)  2  1 


0 
0 


q 
o 

n 


row: 
1 
2 


r(i) 


n 


row: 
n 


r(i) 

2 
1 


According  to  the  sequence  r,  these  2  matrices  will  have 
all  entries  located  in  the  same  form  except  that  they  are 
opposite  to  each  other  in  both  position  and  p,q  values. 
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Notice  that  each  row  of  these  matrices  contains  exactly- 
one  entry  q  namely  the  (i,r(i)) —  one,  and  that  the  label- 
ling of  rows  and  columns  of  P   begins  at  the  lower  right 
corner  while  the  labelling  of  P,  begins  from  the  upper  left 
corner.   The  off-diagonal  matrices  Q,  and  Q   consist  of  all 
zeros  except  for  the  lower  left  corner  of  Q,  which  is  q, 
and  the  upper  right  corner  entry  of  Q  ,  which  is  p. 

With  this  partition,  equation  (1)  decomposes  in  to  two 
equations 

Uh(I  -  Ph)  =  (0,0,...,0,y(n,t)p)  (2) 

yt(I  -  Pt)  =  (y(n,h)q,0,...,0),  (3) 

where 

yh  =  (y(l,h)  , .  .  .  ,y(n,h)) 


1 


Vt  ■■  (n(n,t)  ,.  .  .  ,y(l,t)). 

From  (2)  we  have  y,  =  (0  ,0  , .  .  .  ,0  ,y  (n  ,  t)p)  (I  -  P,) 
Let 

aij  =  Ci,jD—  entry  of  (I  -  P^"1 

.  .    y(i,h)  =  y(n,t)pa   .     ,  i  =  1,2,. ..,n.  (4) 

Using  the  formula  for  matrix  inversion  we  get 


t1  -  Vn.n 

a.  ■  =  **-* — - 


"JVM 
I1  "  V 


where  |I  -  P.  |  is  the  determinant  of  (I  -  P,  )  and  (I  -  P,  )  ( .    .. 

h- 


is  the  (j,i) —  cofactor  of  (I    P,)  matrix.   Hence 
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u(sh)  -  .f1  y(i,h)  =  fl";^,     &  (I  -  Ph)(i,n).  (5) 

Let  A  be  the  determinant  of  the  nxn  matrix  obtained 
n 

from  (I  -  P,  )  after  replacing  the  n —  column  of  (I  -  P,  ) 
matrix  by  a  column  of  l's, 


A  = 
n 


q   -q 
P    1 


1 
1 


1  -q 


1  -q  1 


(6) 


Expanding  A   along  this  last  column  we  obtain 

n 

A  =  .J-  (I  -  P.  )  ,.   .  , 

n    i=l  v     hJ  (i ,n) ' 

since  the  (i,n) —  cofactors  of  A   and  (I  -  P,  )  are  identical 
thus  u(Sv)  in  (5)  can  be  rewritten  as 


urS  1  =  yCn,t)p  A 
yl-V    |  I  -  PJ  V 


Since  the  only  transition  between  S,  and  S   is  through 
state  (n,h)  and  (n,t)  we  must  have 

y(n,t)p  =  u(n,h)q  (7) 

in  the  stationary  regime. 
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Using  the  identity  in  (4)  we  get 


p(n,h)  =  u(n,t)p   [t  _V"   ^ 


and  substituting  in  (7)  we  get 


!  •   Pj  =  q(]  -  Ph) 

Vcn.;) 


h1    H^x   lhJ(n,n) 
Consider  (I  -  P,  )  ,_  _-,  which  has  the  form 


(I  ■  Vcn.n) 


q   -q 

•p   i  -q 


"P 


1   -q 


q 

i 


which  is  the  same  as  the  determinant  of  the  (n-l)x(n-l)  ma- 
trix I  -  P,  obtained  for  the  chain  f  _-.   Putting  temporarily 
the  superscript  (n)  for  the  number  of  states  in  S,  ,  we  have 
a  recurrence  relation 

1 1 00  _  p(n),    =  q)  jCn-1)  .  p(n-l)( 


Since 


|I 


we  obtain 


CD  .  p(l) 


(n)    D(n) 


|!UU  .  p 


n 
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Thus 


n 


u(SO  =  -£  y(n,t)p 


n 


Going  back  to  (3)  and  repeating  all  the  steps  found 

above  we  can  end  up  with  the  similar  expression 

B 
u(St)  =  -£  y(n,h)q 

P 

where  B   is  the  determinant  of  order  n 
n 


B 


n 


■p   1 


-p   1 


1   -q 
-p  -p 


B   is  the  determinant  of  the  matrix  (I  -  P„.)  after  we 
n  K  tJ 

replace  its  n —  column  (in  the  most  left  hand  side)  by  a 
column  of  l's. 

Hence,  using  (7)  again,  we  have 

y(S,)     n   A 

uTsry       n  B 

K    tJ         q     n 

and  it  left  to  prove  that  A   and  B   have  the  form  as  stated 

1  n      n 

in  (a)  and  (b) . 
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Consider  A   first.   To  evaluate  this  determinant  let 
n 

I1  =  {i  =  2,3,. ..,n:  r(i)  =  1}. 

Notice  that  I1  is  the  set  of  exactly  those  rows  indices  i 
for  which  the  (i,l) —  entries  in  (6)  are  -p.   Multiply  the 
first  row  in  (6)  by  p/q  and  add  it  to  all  rows  such  that 
iel...   This  operation  does  not  change  the  determinant  of 
the  matrix.   So  we  have 


A   = 
n 


q   -q 

0    q   -q 

.   -p    1 


"P 


'21 


1    -q 


(n) 


0  2n 
The  entries  in  the  last  column  are  given  by 

1  +  2-   if  iel 


(n) 
'21 


q   1 

1       if  itfl- 


Expanding  this  determinant  along  the  first  column  we 


have 


-  qDfn)  , 


,00 

2 


where 


d|  *      -   A   is  the  original  determinant  (6)  and 
In  b 
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D 


(n) 


q   -q 

p   i  -q 


t(n) 


"P 


.GO 

'2n 


This  determinant  is  of  order  n-1.   Notice  that  the  en- 
tries in  the  first  column  in  the  determinant  D~  '    are  -p 
only  for  row  indices  1  =  3,...,n  such  that  either  r(i)  =  1 
or  r(i)  =  2.   Hence,  letting  I~  =    {i  =  3,4,...,n:  r(i)  <  3}, 


mu 


(n) 


ltiplying  the  first  row  in  D^  J    by  p/q  and  adding  to  rows 


with  iel~  this  determinant  becomes 


q   -q 

0    q   -q 


t(n) 

32 


1   -q 


(n) 
3n 


The  entries  in  the  last  column  are 
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f  t(n)  +  a  t(n) 
2i     p   22 


(n)_ 
'3i 


if  iel, 


=  < 


(n) 
2i 


if  i£I. 


Expanding  again  along  the  first  column  we  have 
D'n)-  qD™  , 


where 


D 


(n)  _ 


q   -q 


1   -q 


(n) 
33 


1   -q 


'  t(n) 
r3n 


Proceeding  in  this  fashion  we  obtain  a  sequence  of  de- 
terminant Dn(n),  D0(n),...  ,D(n)  (8) 
1   '   2        n 

where  determinate  Iv.    is  of  order  n-k+1.   The  entries  tA,J 
in  the  last  column  of  dS    '    satisfy  the  recurrence  relation 


t(n) 
rk+l,i 


rki    p.  Tk,k 


(n) 
ki 


if  iel. 


if  UK    » 


GO 


1,2,. ..,n;  i  =  k,k+l,...,n;  and  t,  -  =  1,  where 

J- » i 


(9) 
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I.  =  {i  =  k+l,k+2,. . . ,n:   r(i)  <  k}. 


Further 


so  that 


uk+l   qUk   » 


r.(n)     n-l^fn)     n-l„.(n) 
1     n    n     n    n,n 


(10) 


Consider  the  determinant  A  .-  o£  order  n+1  obtained  by 

n+1  7 

using  the  same  sequence  r, 


n+1 


q   -q 

■p      i    -q 


1 
1 


1   -q 


1  1 

1 


Applying  the  above  procedure  to  DJ    J  =  A  +-  we  obtain 


a  sequence 


D(n+1) 


D(n+1) 
»  n+1 

(n+1) 


(11) 


where  the  determinants  D£    *    again  satisfy  (9)  with  n  re- 
placed by  n+1.   Arrange  now  the  last  columns  of  the  sequences 
(8)  and  (11)  into  triangular  arrays  as  follows: 
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(n) 


(n) 
1,1 


t(n)   t(n) 
^1,2'  ^2,2 


=  < 


t(n)   t(n) 
l,n'   2,n' 


•  ,t 


(n) 
n,n 


(n+1) 


tl,l 


(n+1)    (n+1) 
ll,2   '   2,2 


■  < 


(n+1)    (n+1) 

l,n   '  ^.n   »  *  *  *  'l 


(n+1)    (n+1) 
l,n+l'  L2,n+1' 


(n+1) 
n,n 


(n+1)    (n+1) 
'  Ln,n+1»   n+1, n+1 


Since  for  i  <  n  by  the  definition  of  sets  il  J  ,  ielf11^ 

-  r     J     i  •   r   •   r(n+l)    ^.i     r-  rr  r  (n)      i  ~(n+l) 

if  and  only  if  leli:     ,  the  first  n  rows  of  T v  J    and  T^    ' 
are  identical,  i.e., 


:k\i  =  ^i13   '  i  =  k---n;  k  =  1,2,. ...n.   (12) 


Next  by  (9) 


t(n+l) 
zk,n+l 


if  k  =  1,2, .. . ,r(n  +  l) 


k-1 


(n+1) 


1  +  E  z       tln  iJ 

1    q  £  =  r(n  +  l)   £,  £    if  k=r  (n+1)  +  1  ,  .  .  .  ,n+l 
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In  particular  for  k  =  n+1  since  r(n+l)  <  n+1 

rn+l,n+l    x        q  £=r(n+l)  1,1 

But  by    (12)    for   I   <   n+1,   we  have 

(n+1)        =      (n)    _  =      (*) 

^S.,1  "     *-&,£    "     '     '     '  LSL,l    ' 

Hence,  from  (13)  we  have 

t(n+l)    =  i  +  E  ?       t(*) 
n+1, n+1    x        q  A=r(n+1)  *£,£  ■ 

Hence,  by  (10) 

D(n+1)  D(£) 

1       -  1  +  ^  .?        ! 


(13) 


n  q  £  =  r(n+l)   £-1  ' 

q  H     <•      q 


Since  D>  J  =  A  ,  we  have 
1      n' 


.n  ,     "•       ,  _n-£ 


An+1  =  q  +  P  lfirCn+1)  AAq    ^=1,2,...   (14) 
where  A,  =  1. 

The  recurrence  relation  for  B   is  established  in  exactly 

n  ' 

the  same  fashion.   Notice  that  the  difference  of  determinants 
A   and  B   is  only  p  and  q  are  in  reverse  position,  thus  on 
the  same  sequence  of  r  we  have 

Bn+1  -  Pn  ♦  q  Jr(n+1)  B£pn"\  n  =  1,2,...   (15) 

From  (14)  and  (15)  we  had  completed  the  proof  of  propo- 
sition 1. 
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IV.   DETERMINATION  OF  OPTIMAL  ALGORITHM  FOR  SMALL  n 


Since  the  error  probability  is 
P(e)  =  <  1 


{■•Si 


in  order  to  minimize  P(e)  we  need  to  maximize  the  ratio 
y(Sh)/y(St). 

From  the  last  section,  we  have 


y(sh) 


n  A 


where 

h   <  p  <  1  and  q  =  1  -  p. 
Thus  the  ratio  y(S,)/y(S  )  is  maximized  if  A  /B   is  maxi- 
mized.  The  ratios  A  /B   depend  on  the  sequence  f  (rl  = 

n  n   r  M       n  - 

<  r (2) ,r (3) , . . . ,r (n)  >.   After  we  have  determined  all  pos- 
sible algorithms  f  (r)  (there  are  (n-1)!  possible  algorithms 
in  total)  and  computed  the  corresponding  value  of  A  /B  we 

can  search  for  the  maximum  A  /B  .   This  way  we  can  identify 

n'  n  '  ' 

the  optimal  algorithm  f * (r)  and  obtain  the  maximum  value  of 

y(Sh)/y(St). 

In  what  follows  we  carry  this  program  for  the  case  of 

small  n.   For  the  case  of  n  =  1 ,  we  have  P(e)  =  q  which  means 

that  the  decision  made  without  any  memory  can  be  done  with 

probability  of  error  equals  to  q. 

Ratios  A  /B  were  calculated  algebraically  for  n  =  2,  3, 
n   n  b  '  '       ' 

4  and  5.   For  the  value  of  pe(%,l)  it  was  found  that 
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m- 


for  any  £ 


/A3\* 

1^—  I  =1  for  any  f 

/Ac\*     3     2      2  * 

\  5'     p   +  pq   +  q 

The  algebraical  work  to  determine  these  values  is  con- 
tained in  Appendix  A. 

For  values  of  n  from  6  to  10,  several  values  of  p  >  % 
were  chosen.   The  search  for  the  optimal  algorithm  was  per- 
formed by  using  an  IBM  360/67  in  double  precision.   The 
program  found  in  Appendix  B  was  becoming  too  time  consuming 
when  n  exceeds  10.   For  the  values  of  p  in  the  vicinity  of 
1,  value  of  p  at  .99  and  .999  were  used.   However,  in  the 
vicinity  of  h,    the  optimal  algorithm  was  determined  by  a 
Taylor  series  expansion  around  h   +  e ,  and  neglecting  terms 
with  e  having  power  greater  than  or  equal  to  2.   The  deri- 
vation of  this  expansion  was  as  follows: 

A"  (M 

A  (q)  =  A  (h    -    e)  =  A  (%)  -  eA' (h)    +   !,    (-e)2  +  R 

nVH;     nv     '     nK    J  nv  J  21         v  ' 

where  R  is  the  remainder  term. 

Ignoring  the  terms  which  have  en  where  n  >  2 ,  we  get 

kn{h    -e)  =  An('2)  -  eA^Os).   Similarly  B^  +  e)  =  k^ih)    + 

eA' (h) .   Consider 
n 
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A    (q)         A      -    eA' 


B    (p) 

n  Vf  J 

A      + 
n 

eA' 
n 

A' 

1  -  e 

n 
A 

n 

A' 

1  +  e 

n 

A 
n 

To  maximize   X,    we  minimize  A'/A    .      Differentiating  A     with 
respect   to   q  we   get 

Ai+1(q)     =    Icq^1    ♦    Jr(k+1)    qk"*  +  1    A^q)     ♦    jr  [&  \  j  *\  (q),""  * 

which  is  recursive  in  A£  and  A,  ,  where  q  =  %. 

The  program  for  the  search  of  the  ratio  A'/A   is  at 
r      b  n  n 

Appendix  C.   Notice  that  this  approximation  of  p  value  yield 
the  same  result  in  term  of  optimal  algorithm  as  in  the  case 
when  p  =  .51  shown  in  Table  I  and  II. 

These  results  are  summarized  in  Table  I,  Table  II  and 
Figure  5,  6  and  7.   Table  II  presents  the  results  of  Table 
I  in  the  form  used  in  [5]  by  Anderson.   The  result  was  not 
so  much  different  although,  the  algorithm  presented  here  has 
no  stopping  rule.   After  some  amount  of  observation  were 
received  the  decisions  are  made  with  the  same  probability  of 
error  as  in  [5].   Anderson  in  his  paper  made  a  point  that 
the  finite-memory  algorithm  seems  to  operate  in  a  similar 
fashion  as  a  human  decision  making  process.   The  result  of 
this  paper  seems  to  support  his  observation.   The  mechanism 
of  remembering  and  forgetting  seems  to  resemble  somewhat  the 
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procedure  of  making  decision  under  finite  memory  constraint. 
The  fact  that  man  can  gain  experience  and  learn  during  his 
entire  life  span,  this  makes  one  to  believe  that  there  must 
be  some  kind  of  data  summarized  mechanism  such  that  he  can 
make  a  decision  with  some  degree  of  confidence  at  a  given 
moment.   The  word  experience  may  be  one  of  the  key  which 
can  lead  to  further  study  about  the  mechanism  of  data  sum- 
marization.  The  algorithm  proposed  in  this  paper  can  be 
viewed  as  a  data  summarized  procedure  under  simple  hypotheses 
with  Bernoulli  observations.   The  more  complicated  machine 
can  be  developed  by  considering  the  general  case  of  multi- 
ple hypotheses  testing  under  finite  state  memory.   The  idea 
was  discussed  in  [7]  by  Yakowitz  and  in  [9]  by  Salagowicz. 
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Figure  5.   Graph  of  Hn  max  ~  vs.   p  values 
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*  n 


5     6     7     8     9    10    11    12 


Figure  6.   Graph  of  In   max  (A  /B  )  vs.  n  values  (p  from  .50-. 70) 
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p  =  .75 


p  =  .80 


p  =  .85 


Figure  7.   Graph  of  £n  max  (A  /B  )  vs.  n  values  (p  from  .75-. 99) 
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APPENDIX  A 

Determination  of  (A  /B  )*  and  f*  algebraically  for  n=2, 
3,  4,  and  5.   Recall  that  we  have 

Ak+1  -  qk  +  P^r(k+1)Mk"£ 

Bk+1   =   P      +   Mr(k+lAP         ' 

The  case  of  n  =  2,  we  have  the  unique  algorithm  f  =  <1> 
because  the  transition  from  state  2  to  state  1  can  be  done 
only  by  the  jump  of  1  or  r(2)  =  1.   According  to  the  formula 
above  we  have 

A9  =  q  +  pCA-jq0)  =  q  +  p  =  1    where  A..  =  1 


>7  =  p  +  q(B..p0)  =  p  +  q  =  1   where  B-. 


A2 

— -  =  1 

B2 

The  case  of  n  =  3 ,  we  have  two  possible  algorithms  to 
investigate,  namely, 

f3  "  <  LI  > 


and 


f2.  =  <  1,2  >. 


For 


fj  =  <  1,1  >; 

A3  =  q2  +  p(A1q  +  A2q°) 
A2  =  1 
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•  •   A3  =  q2  +  Pq  +  P 

=  q  +  p 

=  1. 
By  the  same  derivation, 

b3  "  p2  +  pq  +  q 

=  p  +  q 

=  1 

1 


C 


"»?)■'• 


For 

f 

3 


f\      =    <    1,2  >; 


A3   -  q2  *  PA2 

2 
=  q   +  p 

and 


But 


B3   =  p2  +  q. 


2         2 
p   +  q  =  q   +  p  for  ^<p=l-q<l 


G)* 


Note  that,  the  superscripts  at  the  ratio  are  for  numbering 
purposes . 


Thus  for  the  case  n  =  3,  the  optimal  algorithm  can  be 
*  =  <  1,1  >  or  f* 


either  f,  =  <  1,1  >  or  f*  =  <  1,2  >. 
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Before  we  proceed  the  proof  for  the  case  n  =  4  and  5,  we 

note  that  the  expression  for  A   and  B   differ  only  in  p  and 

r  n      n  '     r 

q  being  interchanged.   For  this  reason,  the  expressions  for 

B  will  be  presented  without  computation. 

Determination  of  A  /B   and  f*  algebraically  for  n  =  4 

n'  n      n   &         J 

and  n  =  5.   Recall  that 

k     k  k-l 

Vl  =  q  +  Mr(k+1)  M 

Bk-1  =  Pk  +  Mr(k+1)  B!?k*1' 
For  the  case  when  n  =  4,  the  possible  forms  of  algorithm 
are  (n  -  1)  I    =    3  !  =  6,  which  are  as  follows 

fj  =  <  1,1,1  >,  f\   =  <  1,1,2  >,  f3  =  <  1,1,3  > 

f*  =  <  1,2,1  >,  t\   =  <  1,2,2  >,  fj  =  <  1,2,3  >. 

The  optimal  algorithm  among  these  combinations  is  f|  = 
f3  and  (A4/B4)*  =  (A4/B4)3  =  (q3+pV(p3  +  q)  where  superscript 
labels  the  combination  order. 


Proof 


For     f4; 


A4    =    q3    +    p(Aiq2    +    A2q    +    Aj) 
A3    =    q2    +    pCAjq    +    A£) 
A7    =    q    +    pA1 


Al    =    ! 


A2    =    1 


and 
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a3  =  q2  +  pq  +  p  =  q  +  p  =  i 


hence 


3     2 

A4  =  q   +  pq   +  pq  +  p 

2  , 

=  q  (q  +  p)  +  pq  +  p 

=q   +pq+p=l. 
The  same  result  can  be  obtained  from  computing  B. 


■••(» 


=  1 
4- 


It  was  proved  that  the  case  when  n  =  2,  the  transition 
always  goes  to  1.   So  the  value  of  A2  always  equals  to  1  so 


as  B2 


For  fj    =<  1,1,2  >; 

A4  =  q3  +  p(A2q  +  A^ 
A3  =  q2  +  p(A1q  +  A£) 


where 


A1   -   A2    =    1 

=   q      +pq+p=l 
A,    =    q      +    pq    +    p 


B4    =    p      +    pq 

+  q 

( A4 J       _    q3    +    pq 

+   P 

UJ              3 

V    4'            p      +    pq 

+  q 

or    f3        =    <    1,1,3 

>; 

A4    =    q3    +    pA. 
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A3=    1 

.    .      A4   -    q3   +   p 


'  '   \V      p3  ♦  , 


For   fj        =<    1,2,1    >; 

A4    =    q3    +    p(Aiq2    +    A2q    +    A^) 
A3   =    q2    +   pA2 
=    q      +    p 
.     .       A4    =    q      +    2pq      +    p    =    q      +    pq      +   p 


(M      =    q5    *    2pq2    +    p 
\    4*  p      +    zpq       +    q 


For   fjj        =    <    1,2,2    >; 


4 

l3 


A,    =    q3    +    p(A?q    +   A_) 


A,    =    q2    +   P 


hence 

3  2  2  2 

A4=q      +pq+pq      +p      =    q      +p 


'   \V  o2    +   a 


=    1      1 
'4/  p*    +    q 


since 


2  2 

q    +  p  -  p    +  q. 

For      £4  =  <    1,2,3   >; 

A4  =  q3    +    pA3 


51 


A3   =    q2  +   p 

•     •      A4   -   q  +   pq      +   p 

2  „2 

=   q  +  P 


■■$■>■ 


4 

s 

Consider  1  >  p  where  p  >  %. 

^   2 

P  >  P 

1  >  p2  -  p  +  1 

1  >  p2  +  q  =  p2  +  pq  +  q2 
p  -  q  >  (p  -  q)(P2  +  pq  +  q2)  =  p3  -  q3 

q3  +  p  >  p3  +  q 
^^  >  1 

p    +  q 

or  (A4/B4)3  >  1  and  (A^)1,  (A4/B4)5  and  (A4/B4)6  are 
dominated. 

Stated  without  a  proof  is  the  following  simple  fact; 
Let      a  >  0 ;  b  >  0  and  c  >  0 
if     a/b  >  1  then  a/b  >  a  +  c/b  +  c. 

This  result  will  be  applied  when  needed  without  notice 

9 i_p_  >    q +  pq  +  P 

*     '         3  3 

p+q  p+pq+q 


(J})' '  ft)'  ■ 
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Obviously, 

(a5  •  te)' 


since 


q3   +   P   >   q3   +    2pq      +   p 

3  3  2 

p+q        p+2pq    +    q 


where 

2pq   <  2p  q. 

Thus  for  n  =  4,  the  optimal  algorithm  is  fjj  =  <  1,1,3  > 

For  the  case  of  n  =  5 ,  the  possible  forms  of  algorithms 

f  =  <  r(2),  r(3),  r(4),  r(5)  >  are  as  follows: 

f*  =  <  1,1,1,1  >      f2s      =    <    1,1,1,2  > 

fg=<  1,1,1,3  >      £5   =<  1,1,1,4  > 

ff-<  1,1,2,1  >      fr   =<  1,1,2,2  > 


£75    =    <   1,1,2,3  >  fjj   =  <  1,1,2,4  > 

f,.  =  <  1,1,3,1  >  £_   =  <  1,1,3,2  > 

f^1=  <  1,1,3,3  >  f*2  =  <  1,1,3,4  > 

f*3=  <  1,2,1,1  >  f*4  =  <  1,2,1,2  > 

fj.5=  <  1,2,1,3  >  £56  =  <  i*2*1'4  > 

fr   =   <   1,2,2,1  >  fr    =<   1,2,2,2  > 

f*9  =  <  1,2,2,3  >  f20  =  <  1,2,2,4  > 

fj.  =  <  1,2,3,1  >  fr   =  <  1,2,3,2  > 

f23=  <  1,2,3,3  >  f24  =  <  1,2,3,4  >. 
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The  following  algebraical  proof  reveals  that  the  optimal 

o 
algorithm  is  f*  =  fr  =  <  1,1,2,4  >  and  the  maximum  ratio 

T  O  7    J   7  f\  mj 

Ac-/B5  of  this  algorithm  is  (q   +  2p  q  +  p  )np      +  2pq   +  q  \ 
for  all  value  of  pe(%,l). 

Proof.   If  it  is  simpler  to  break  the  possible  f's  into  a 
group  of  4  and  identify  the  maximum  ratio  AJB,.  among  the  4 
and  the  result  of  these  6  groups  will  be  compared  later. 


£-  »  <  1,1,1,1  >;  from  the  case  n  =  4 ,  f .    =  <  1,1,1  >  we 


have 


A^    1,  A2    1,  A j  :   1,  A^    1. 

.  .  A5  =  q4  +  p(Aiq3  +  A2q    +    A3q  +  A4) 

=  q   +  pq3  +  pq2  +  pq  +  P 
=  1 
=  B, 


(» 


1 
=  1 


f2  =  <  1,1,1,2  >; 

A5  =  q4  +  p(A2q2  +  A3q  +  A4) 

=  q4  +  pq2  +  pq  +  p 
4     2 

b5=p  +pq+pq+q 


2 

(tl)      =    Q4  2  PQ2  *   PQ  +  P 
V  Br J  4     2 


f3  =  <  1,1,1,3  >; 
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and 


A5    =    q4    +   p(A3q    +   A4) 
=   q      +   pq   +   p 


4 
B5    =   p      +    pq    +    q 


'         /M      =    q4    *    pq    *   y 
'     '      V  V  r4    +    .a    +    a 


f4   =   <   1,1,1,4   >; 

A5    -    q4    +    pA4 

4 
=   q      +   p 

B5    =    p4    +    q 


t4 

p""    +    q 


Consider  2pq  >  0    \fpe(%,l) 

1    +    2pq    >    1 

i  >   Cp  +  q)2   "   2pq 

,    ^      2  2  2  2,3 

1   >   p      +q      =    p      +pq      +q 

=    p(p    +    q2)    +    q3 

■   P(P2    +    q)    +    q3 
3  3 

■  p    +  pq  +  q 


4  4 

p  -  q 

^      4  4 

-    q    >    p       -    q 


4  4 

q    +  p  >  p    +  q 

4  /AA4 

3 1_P_  =  [  _J. )      >    i 

p4  +  a   vbs; 
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■••(»' *(»' 


and 


and 


q4    +    P    >    Cq4    *   p)    *_   pq  ft*)      >     /M* 

P4    ♦    q  (P4    ♦    q)    +    pq  ^V  >>V 


q    +  p  ^  Cq    +  p)  +  pq    +  pq      •  2^2 

-*2 *-   >    -^-S " *-* "■     since   pq      <   p    q 

p    +  q      (p    +  q)  +  p  q  +  pq 


(»'  •  (»' 


■■■(a 


^— J   is  maximum  in  this  group  of  4 

fc  =  <  1,1,2,1  >;  from  the  case  n  =  4,  the  algorithm  of 

the  form  f-,  =  <  1,1,2  >  we  have  A,  =  A?    =  A.,  =  1  and  A.  = 
3  , 

q    +  pq  +  p. 

A5    =    q4    +    pCAxq3    +   A2q2    +    A3q    +    A4) 

A  T  O  "ZOO 

=    q      +pq      +pq      +pq+pq      +pq+p. 
After   simplifying  we   get 

A5    =    Cp2    +   q)Cpq   +    1) 
B5   =    Cq2    +   P)(pq   +    1) 


/As\        Cp2  +  q)Cpq  +  i)       i  2  ; 

I  r^-j     =   -^-^ ^J  K*^ J-  =    1   since   p      +   q   =   q 

^5'  fqZ    +    p)fpq    +    1) 


5 

Cq"  +  p) (pq  +  i) 

f6s   =    <    1,1,2,2    >    ; 


A  *7 

A5    =    q      +    p(A2q      +    A3q    +    A4) 
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A  *5  T  *J  o 

q      +pq      +pq    +    pq      +pq    +    p 


B5    = 
6 


ft)- 


2  2 

q      +    p    q    +    p 

2  2 

p    +  pq    +  q 


2  2 

q    +  p  q  +  p 

2  2 

p    +  pq    +  q 


f '    =   <    1,1,2,3   >; 


A5    =    qH    +    p(A3q    +    A4) 

4  3     2      2 

=q   +pq+pq   +pq+p 


=  q3  +  p2q  +  p 
B,  -  P3  +  pq2  +  q 


and 


(»'■ 


3     2 

q  +  p  q  +  p 

3      2 

p  +  pq  +  q 


<  1,1,2,4  >; 


A5  =  q   +  p(A4) 

4      3     2      2 

-  q  +  pq  +  p  q  +  p 


3     2  2 

=  q  +  P  q  +  P 

3      2  ? 

•  B,.  =  p"5  +  pq^  +  q^ 


ft) 


3     2      2 

q  +  p  q  +  p 

3  x    2  .   2 

p  +  pq  +  q 


3-2      3 

q    +  2p  q  +  p 

3  ^  ,   2  A   3  • 

P   +  2pq  +  q 
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o 

Among  5,6,  7  and  8,  the  ratio  (A5/B_)   is  the  maximum, 
Consider  2pq  >  0 


.     .      p(l    -   p)    +    q(l    -    q)    >    0 

2  2  2  2 

.     .       p-p      +q-q      >    0<=£p    +    q    >    p      +    q 

\. 

2  2 

..       p+q+pq>p      +q      +pq 

(p  +  q)  (p  -  q)  +  pqCp  -  q)  >  Cp  -  q)(p2  +  q2  +  pq) 

2  2  2  2    „      3  3 

p       -q      +pq-pq      >p       -q 


8  /(    v5 

3  2  2  3  2  2. 

q    +pq  +  p    >p    +pq    +q=» 

'5/  N"5 


ft)1  •  ■  ■  ft)s 


Consider 
.6 


/ACV  2  2,  3  2,2  2 

(_5)     =9 +Pq    +    P_q      +pq+P      +pq      +pq 

VBc/  2  2    .  3    .         2    ^      2    ^      2      ^ 

N    5'  p      +pq      +q        p      +pq      +q      +   p   q   +   pq 

3  2  2 

<  £ +  p  q  +  P 

3  2  2 

p    +  pq    +  q 

Since  8         ..   ,6 


pq3  +  pq  <  p3q  + 

'5'  \"S> 


«*ft)  >  ft)  • 


From 


/AA7  3    ,       2 

[_i )   =  a +  p  q  +  p 

\V  p3    +    pq2    +    q 

rewritten   as 


/M      m    q5    +    2p2q    *    p5    ♦    pq 

VBc/  3         ,       2  3 

\    5'  d      +    zdq      +    a      +    ua 


2pq      +    q      +    pq 
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q5  +  2p  q  +  p5  „  XS +  2p  q  +  p5)  +  pq 

p3  +  2pq2  +  q3    (p3  +  2pq2  +  q3)  +  pq 


ft)'  •  ft)'  • 


g 
£-  =  <  1,1,3,1  >;  for  the  case  n  =  4,  the  form  f.  = 

3 
<  1,1,3  >  we  have  A.,  =  A2  =  1,  A,  =  3  and  A.  =  q   +  p. 

.     .       A5    =    q4    +    p(Axq3    +    A2q2    +    A3q    +    A4) 

4,3,2  32 

=    q      +    pq      +    pq      +    pq    +    pq      +    p    . 

After   simplifying  we   get 

3  2 

A5    =   pq      +   p      +    q 

D  3,2 

••       B5=pq+q      +    p 

3  2 

=    p    q    +    p      +    q 


••(» 


pq+p+q^n       .  3^3 


-4 *-«- *  <    1    since   pq      <   p   q. 

5'  p    q    +   p      +    q 

f*°    =    <    1,1,3,2    >; 

A5    =    q4    +    p(A2q2    +    A3q    +    A4) 
4  2  3  2 

■  q    +  pq    +  pq  +  pq    +  p 

3  2  2 

■  q    +  pq    +  pq  +  p 

2 

■  q    +  p 

2 
.     .    B5      =    p      +    q 

*■  2 

-  q    +  p 


/A  \10 

■•(a  =i 
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f*1  =  <  1,1,3,3  >  ; 

A5  =  q4  +  p(A3q  +  A4) 

4  3     2 

=  q   +  pq  +  pq   +  p 

3  2 

=  q   +  pq  +  p 

3 

=  q  +  p 

.".   B5  =  p3  +  q 


ft) 


11     3  + 

=  q  +  p 
'         p3  +  q 


f\2   =   <   1,1,3,4  >; 

A5  -  q4  +  pA4 

4      3     2 
=  q4  +  pq"5  +  pZ 

3     2 

=  q  +  p 


B,  =  vJ  +   q' 


•■(» 


12    „3    „2 

3  A   2 
p   +  q 


12 
This  group  from  9,  10,  11  and  12  the  ratio  (Ar/Bj.)    is 


maximum.   Clearly 
10 

I 


ft)" --ft)'- 


Since 


^   2 

P  >  P 

1  >  p2  +  q 
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p    +    q    >    p2    +    pq    +    q2 

(p  +  q)(p  -  q)  >  Cp  -  q)(p2  +  pq  +  q2) 

2  2    %      3  3 

p    -  q    >  p    -  q 

3  2  3  2 

q    +  p    >  p    +  q 


^•-(»  >ft) 


and 


ft) 


,r   .             „3   ,                 3,2  32 

_i.l     =  q    +  p  =  q    +  p    +  pq  <    q    +  p 

r  I                3                     32  32 

5/          P+q      P+q+pq  p+q 

12         ,A    xll 


ft)  •  ft) 


13 

fft   =  <  1,2,1,1  >;  from  the  case  n  =  4,  the  form  of 

2 
f,  =  <  1,2,1  >  we  have  A-  =  A~  =  1,  A-  =  q   +  p  and  A.  = 

2     2        2         2 
q   +pq   +p=p   +q+pq 

.     .       A5    =    q4    +    p(Aiq3    +    A2q2    +    A3q    +   A4) 

4  3  ?  32  322 

=    q      +    pqJ    +    pq      +    pq      +    p    q    +    p      +    p    q      +    pq 

After  simplifying  we  have 

A5    =    pq2    +    q      +   P 

2  2 

..       B5    =    pq    +    p      +q 

2  2 

=    p    q    +    q      +    p 


ft) 


13  2  2 


pq"    +q+P^i       •  2^2 

=   iftJ -^s 1-  <    1    since    pq      <   p    q. 

p    q    +    q      +    p 
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•  14 


=    <    1,2,1,2    >; 

A5    =    q4    +    p(A2q2    +    A3q    +   A4) 

4  2  3  2  3  2    2 

=    q      +pq      +pq      +pq    +    p      +pq      +    pq 

3  2  2  7    2 

=    q      +pq      +p      +pq   +   pV 

2  2    2 

=   q      +   p    +   p   q 


-2  2    2 

B5    =   p  +    q    +    pV 

2  2    2 

=    q  +    p    +    p    q 


ft) 


14 


=    1. 


.15 


=    <    1,2,1,3    > 


A5    =    q      +    p(A3q    +   A4) 

4  3  2  3  7    2 

=  q    +  pq    +  p  q  +  p    +  pV  +  pq 

3  2  2    2 

=    q      +p      +pq      +pq 


■ft) 


B5 
15 


3  ,                 2    2 

q  +  p  +  p  q 

3  2    2 

p  +  q  +  p  q 


3  ,22 

q  +  p  +  p  q 

3  2    2 

P  +  q  +  P  q 


.16 


<    1,2,1,4    >; 


A5    =    q"    +   p(A4) 

4  3  2    2 

=q      +p      +pq      +pq 

4  7  7     7 

■  q    +  p(p    +  q)  +  p  q 

4  2  2    2  2  2 

=q      +pq      +pq      +p      since   p      +    q 
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B5    ■    P4    +    p2q    +    P2q2    +    q2 


ft) 


4              2  2    2            2 

=  q    +  pq +  p  q    +  p 

4^2  22^2 

p    +pq  +  pq    +q 


This  group  from  13,  14,  15  and  16,  the  ratio  (A5/B5) 16 

14  13 

is  the  maximum.   Clearly  (A^/Bp)    >  (A^/B,.) 

Consider 

1  >  1  -  pq 

2  2  2 

l  >  (p  +  q)  -  pq  =  p  +  pq  +  q 

p+q>p2+pq+q 

p2  -  q2  >  (p  -  q)  (p2  +  q2)  +  pq(p  -  q) 

p2    -    q2    >    (p    -    q)  (p    +    q)(p2    +    q2)    +    pq(p    -    q) 
2  2  4  4  2  2 

p    -q    >p     -q    +pq-pq 

4  2  2  4  2  2 

q    +pq    +p    >p    +pq  +  q 


and 


4A      2J2J2    2    x      4    .       2  2  2    2 

q    +pq    +p    +p   q      >   p      +   p   q   +   q      +   p    q 


4  2,2,22 

9 uzs l_e +  p  q    >  isAl 

4   x      2      x      2   x      2    2' l 

p    +pq  +  q    +pq 


•ft)"  •  ft)' 


ft) 


!  ''  3  2    2 

-  q  +  p  +  p  q 

3  I         ,.2    2 
p  +    q    +    p    q 

4  3  2  2    2 

=  q  +pq    +p    +pq  +  pg 

4  3  2  2    2 

p  +pq+q    +pq+pq 

4  3  2,2  2  22 

B    q  +pq       +    P       +pq       +    p    q    +    p    q 

4x3i2i2i         2    ^      2    2 

p  +pq+q  +pq+pq  +pq 

A  O         O         O   1  *") 

=  a +  pq  +  p  +  p  q  +  pq(q  +  p) 

4     2      2     2  2        2 

p  +pq  +  q  +pq  +  pq(p  +  q) 
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<    q4    ♦    PQ2    *    P2    +   P2^2 
4  2  111 

p    +  p  q  +  q    +  p  q 

2  2 

since   pq(q      +   p)    =    pq(p      +   q)  .       Thus 

16        ,  .   v  15 


ft)  -ft)  ■ 


17 
£5   =  <  1,2,2,1  >;  from  the  case  n  =  4  the  form  f.  = 

<  1,2,2  >  tells  us  that  A-  =  A2  =  1,  A_  =  q  +  p  and  A. 

q2  +  p. 

.     .       A5    =    q4    +   p(A1q3    +    A2q2    +    A3q    +    A4) 

4  3  2  3  2  2  2 

=   q      +    pq      +    pq      +   pq      +    p    q    +    pq      +    pc 

2  3  2  2  2 

=  q     +  pq    +  P  q  +  pq     +  P 

3  2  2 

=    pq      +    p      +    q      +   pq 

3  2 

=    pq      +    p      +    q 

••     Bc=pq+q    +P 


5 


3  2 

=    p    q    +    p      +    q 


17  3     ,       2 


/A5\           pq"    +    p"    +    q    „    -,       •  3^3 

/  g-1      =   ^ —^2 *  <    1    since   pq      <   p   q 


p  q 


f*8    =    <    1,2,2,2    >; 

A5    =    q      +    p(A2q      +    A3q    +    A4) 

4  2  3?  ?  2 

=  q    +  pq    +  pq    +  p  q  +  pq    +  p 

3  2  2 

=    q      +    pq       +    pq(p    +    q)    +    p 

2  2 

=    q      +    pq    +    p^ 
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=  p 


.  .  B5   =  q^  +  p 


■••<»" 


=  P 


=  1 


.19 


=  <  1,2,2,3  >; 


A5  =  q4  +  p(A3q  +  A4) 


=  q4  +  pq3  +  p2q  +  pq2  +  p2 

3  2       2     2 

=  q  +  p  q  +  pq  +  p 

2  2      2 

■  q    +  p  q  +  p 

=  q(q  +  p2)  +  p2 

■  q(q2  +  p)  +  p2 

=  q   +  pq  +  p2 


.  .  B 


( 


5 
19 


3 
p-  +  q 


q   +  p 
3 


M  \  q^£ 


'  W 


f20  =  <  1,2,2,4  >; 


A5  =  q   +  p(A4) 

4      2     2 

■  q    +  pq    +  p 

4     2      2 
.  .  B5  -  ]?   +pq  +  q 

'  '  \h)  p4  *  P2q  ♦  q2 
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4  2  2  3 

=    q*    +   pq      *   p    q    +    p° 

4  2  2  3 

p    +pq+pq    +q 

4  3 

=  q    +  pq  +  p 

4  3    ' 

p     +  pq  +  q 

In  this  group  from  17,  18,  19  and  20,  the  ratio  (A5/B  )20 
is  the  maximum.   Clearly  (A5/B  )18  =  1  >  (A5/B  )17. 
Consider 

pq    >    0 

2  2  2  2 

p    +pq  +  q    >p    +q 
(p  -  q)Cp2  +  pq  +  q2)  >  (p  -  q)(p2  +  q2) 

p3  -  q3  >  (p  -  q)(p  +  q)(p2  +  q2) 
=  (p2  -  q2)(P2  +  q2) 


=  P4  ■ 

V 

3     3       ^   4 
p   -  q   +  p.q  >  p 

4 
-  q   +  pq 

q4  +  pq  +  p3  >  i 

4          3    L 

p  +  pq  +  q 

20  ,,    xl8 


Consider 


m  ■  (» 


/A    \19 

ft) 


q 

+ 

p 

3 

p 

+ 

q 

3 

q 

+ 

2 

pA 

+ 

pq 

p3 

+ 

q2 

+ 

pq 

4- 

q 

+ 

3 

pq 

+  p2q 

+  P'  + 

pq 

4 
P 

q4 

+ 
+ 

3 

p  q 

pq  " 

+  pq2 

*q3* 

t    2 

pq(q 

pq 

-  p) 

p4    +    pq    +    q3    +    pq(p2    +    q) 
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4         3 

<  q  +  pq  +  p 

4  I     7   3 

p  +  pq  +  q 


tef  •  CD" 


,».< 


1,2,3,1  >;  the  form  f.  =  <  1,2,3  >  having  A.  = 


A2  =  1;  A3  =  q2  +  p  and  A4  =  q2  +  p2. 


A5  =  q4  +  p(Aiq3  +  A2q2  +  A3q  +  A4) 


4  ♦  pq3  ♦ 


=  q'  +  pq-  +  pq   +  pq   +  p  q  +  pq   +  p" 

3  3      2     3 

=  q   +  pq  +  pq   +  pq   +  p 


b5    =  p3  +  pq  +  p  q  +  pzq 


2      3 

-    +  q 


.  /k,\  3  3,3 

(_A)   =  q  +  pg  +  p  +  pq  + 

\B5/      p3  +  pq  +  q3  +  p3q  + 

3  2  3  2 

iince   pq      +pq      <pq+pq. 


3  .    2 

£§-  <  1 

p  q 


f22  =  <  1,2,3,2  >; 

A5  =  q4  +  p(A2q2  + 


A3q  +  A4) 


=  q4  +  pq2  +  pq3  +  p2q  +  pq2  +  p3 

3      2  3 

=  q  +  pq  +  pq  +  p 

B5  =  P3  +  P2q  +  pq  +  q3 


22 


^  +  Pq  +  P*  +  Pq2  <  1  since  pq2  <  p2q. 
P  +  pq  +  q  +  P  q 


-23 
■5 


(:■'-<  1,2,3,3  >  ; 


A5  =  q4  +  p(A3q  +  A4) 
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■  ■  (® 


4     3    2      2     3 
=  q   +pq   +pq  +  pq   +p 

3  3 

=  q  +  pq  +  p 

B5  =  P3  +  pq  +  q3 
23 


=  1 


f24  =  <  1,2,3,4  >; 


4  2  3 

A5  =  q4  +  pqZ  +  p^ 

=  q2  (q2  +  P)  +  P3 

m    q2  (p2  +  q)  *  p3 

3  2  2     3 

■  q  +  p  q  +  p 

7  7  9      7 

Be  =  P-5  +  pV  +  P^ 


•••ft) 


24 

=  1. 


This  group  from  21  to  24  are  dominated  by  the  maximum 
ratio  from  the  other  groups. 


We 


have  left  to  identify  the  maximum  ratio  among  (Ar/B.) 
(A5/B5)8,  (A5/B5)12,  (A5/B5)16,  and  (A5/B5)20. 

First  let  us  prove  the  algebraical  fact  that  if  a  >  0, 
b  >  0,  c  >  0,  and  d  >  0  then  for  a  >  b  and  c  >  d ,  we  have 

a  +  c    a  +  d  m 

and 

b  *  E-T-d  l£  b  <  d  •  (2) 

68 


4 


5 


Suppose 


a  +  c^a  +  d   ,       _  , 

t— — —  <  u  .  J--  where  c  >  d 
b  +  c   b  +  d 


.  .  (a  +  c)(b  +  d)  >  (a  +  d)  (b  +  c) 

ab  +  ad  +  be  +  cd  >  ab  +  ac  +  bd  +  cd 
d(a  -  b)  >  cCa  -  b) 
and 

(a  -  b)  >  0 

.  .  d  >  c   contradiction. 

.  .  if  a  >  b  and  c  >  d  then  a+c/b+c  <  a+d/b+d  proved  (1) . 
From  (2)  suppose  a/b  >  (a+c)/(p  +  d)  where  a/b  <  c/d  .  .  ab  +  ad 
>  ab  +  be.   a/b  >  c/d  contradiction.    .  .  if  a/b  <  c/d  then 
a/b  <  (a+c)/(b  +  d)  proved  (2). 
Consider  the  ratio 
16 


(® 


4      2     2  2     2 
=  q   +  pq   +  p  q   +  p 

4  A   2   7   22,.   2 


:  '      p  +  p  q  +  p  q' 

4  2     2  2     2  3 

=  q  +  pq    +  p  q    +  o  +  p 

4  2  2  2      9     3 

p  +  p  q  +  p  q  +  pq  +  q 

=  q4  +  pq(p  +  q)  +    p5  +  p2q2 

p4  +  pq(p  +  q)  +  q3  +  p2q2 

4  3     2  2 

=  9 +  pq  +  p +  p  q 

4  3     2  2 

p  +  pq  +  q  +  p  q 

4  I  x   3     \Bc/ 

p  +  pq  +  q      >  5' 


< 
P 

which  is  greater  than  1 
Consider  the  ratio 
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( 


» 

4 

_  q  +  p 

4  ^ 
p   +  q 

=  q4  +  pq  +  p2 
p4  +  pq  +  q2 

4          3 

-  q    +  pq  +  p 

+  p2q 

4          3 

p  +  pq  +  q 

+  pq 

4       .3 
.  q  +  pq  +  P 

p4  +  pq  +  q3 

■ft 

20 


Since  by  the  result  of  (2)  and 
2         4         3 

p_q.  =  e  >  q    +  pq  +  p 
pq   q   p  +  pq  +  q3 

shown  as  follows:   since 

4     3      2  2      3     4 

p  +  p  q  +  p  q  +  pq  +  q  >  o 

*  A  7  *1   O  *7  A 

.     .       (p   +  p  q  +  p  q   +  pq"5  +  q  )  +  pq  >  pq 


(p-q)  (p4+p3q+p2q2+pq3+q4)  +  pq(p-q)  >  pq (P  +  q) (p  -  q) 


5     5     2       2^,2     2^ 

p  -  q  +  p  q  -  pq  >  pq(p  -  q  ) 

5     2      3     3   5    2 

p  +  p  q  >  p  q-pq  +q  +pq 

5     2       3     5      2     3 

p  +  p  q  +  pq  >  q  +  pq  +  p  q 
p(p4  +  pq  +  q3)  >  q(q4  +  pq  +  p3) 

4         3 

2.  >  q  +  pq  +  p 

q   4  A   .  3 

M    p   +  pq  +  q 


20    ,,  v 16 


©  •  (8  •  (» 


Consider 

the 

ratio 

(9* 

_H3 

+  p  q 

+  p2 

3 
P 

2 
+  pqZ 

+  q2 
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3     2,2 
3    "  2    ~ 2 

p  +  q  +  pq 

3     2 

>  q    +  p 

3  .   2  ' 
p   +  q 

Since  by  the  fact  of  (2)  and 

2  3  2 

E— S-  =   P-  >   9-= P— -     shown   as    follows:      we   have 

L        q  3  2 

pq       M      p    +  q 

3  3 

p°    +    q-5    >    0 

p2(l    -    q)    +    q2(l    -    p)    >    0 

2  2  2  2 

pZ    -    pZq    +    qZ    -    pqZ    >    0 

p    +  q    >  p  q  +  pq    =  pq(p  +  q)  =  pq 
(p2  +  q2)(p  +  q)  >  pq 
(p2  +  q2)  (p  +  q)  (p  -  q)  >  pq(p  -  q)  ;  p  >  q 

4  4  2  2 

p    -  q    >  p  q  -  pq 

4  2  4  2 

p    +  pq    >  q    +  p  q 

P(P3   +   q2)    >   q(q3   +  P2) 

3    ,       2 
£  >  9 +   P 

q        3  7     2 

M         p      +    q 


■■ftW 


Rewritten 


( 


S 


8 


^q4    * 

3           2              2 

pq    +  p  q  +  p 

P4    ♦ 
-q4^ 

3                2           2 
p    q    +    pq       +    q 

pq(q    +  p)  +  p 

p4* 

-q4< 

pq(p2    +    q)     +    q2 
p2    +    pq(q2    +    p) 

4  2  2 

p    +  q    +  pq(q    +  p) 
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Consider   that 

1   >   p 

P   >   P2 
1   +   p   >    1   +   p 

1   >   1    -   p   +   p2 

1    >    1    -    2p    +   p2    +   p 

1    >    q2    +    p 

pq  >  pq(q    +  p) . 

By  using   the    fact    from    (1)    and   the    result   shown   above  we   have 

q4    +    p2    ;    Pq(q2    +    p)         q4    +    p2    +    pq    =    q4    ♦    p    =    /M 

4  2  2  4 2  4  \B    / 

p      +    q      +    pq(q      +p)         P+q+pq        P+q         v5' 


■■(»*>&)' 


o 

Thus  (Ar/Bp)   is  the  maximum  ratio  of  the  case  n  =  5 

o 

and  this  means  fr  =  <  1,1,2,4  >  =  f*  is  the  optimal  algorithm 
and  we  have  completed  the  proof. 
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APPENDIX  B 

//BGCC2894  JOB  I  1205, 0530 ,R0X2) » ■ CHODCHCEY  SMC  2899' 

//  EXEC  FORTCLG, REGION. G0=100K 

//FGRT.SYSIN  DO  * 

C 

C 

C 

C  THIS  PROGRAM  CONSISTS  OF  TWO  PARTS,  ONE  IS 

C  THE  MAIN  PART  WHICH  DETERMINING  THE  POSS 

C  -IBLE  ALGOKITHM  FOR  A  SPECIFIC  VALUE  OF  N 

C  FOR  EACH  OF  THIS  ALGORITHM,  THE  VALUE  OF 

C  A(N)/B(N)/  IS  COMPUTED  BY  PART  TWO  OF  THE 

C  PROGRAM  WHICH  IS  THE  SUBROUTINE  OF  THIS 

C  PROGRAM.  FOR  THE  VALUE  OF  N=5,6,7  AND  8 

C  THE  PROGRAM  WORKS  OUT  SATI S  F  ATORRI  LY   - 

C  WITH  TIME.  BUT  FOR  N=9  AND  10  THE  PROGRAM 

C  IS  TIME  CONSUMING. 

C  IN  ORDER  TO  SOLVE  OUT  THIS 

C  PROBLEM  ,  WE  NEED  TO  ELIMINATE  SOME 

C  ALGORITHMS  WHICH  SEEMED  NOT  TO  FIT  THE 

C  PATTERNS  OF  THE  OPTIMAL  ALGORITHM  FOR 

C  THE  CASE  OF  N  SMALLER  THAN  10,  SINCE  ALL 

C  THE  SOLUTIONS  SEEM  TO  HAVE  DEFINITE  PATT 

C  -ERN  OF  OPTIMALITY  ON  A  GIVEN  VALUE  OF  P. 

C  NOTE  THAT  THE  PROGRAM  DOWN  BELOW  CAN  BE 

C  USED  FOR  THE  CASE  OF  N=10  ONLY,  IF  THE 

C  CASE  OF  SMALLER  N  WILL  BE  RUN  THEN  SOME 

C  CHANGES  MUST  BE  MADE  AS  FOLLOW; 

C 

C  1.  ELIMINATE  STATEMENTS  DO  10  IM=1,M-1 

C 

C  AND   R(M)=IM  FROM  THE  MAIN  PROGRAM  FOR 

C 

C  ALL  M=N+l,N+2, . . . , 10 

C       2.  CHANGE  STATEMENT   TRIM= A( 10 ) /8< 1G) 

C 

C       TO    TRIM=A(N)/3(N) 

C 

C       3.  AT  SUBROUTINE  PART,  CHANGE  STATEMENT 

C 

C       DO  10  1=3,10  TO   DO  10  1=3, N 

C 

c 
c 

REAL*8  A(12) , B<12) , SMAX , TRI M, P , Y ( 12 J 
INTEGEk*4  R(12),G(12) 
READ  (5,1000)Y 
1000  FORMAT  (12F4.3) 
DO  100  K3=l,12 
P=Y(K3) 
DO  6  M=l, 12 
R(M)=0 
6  G(M)=0 

SMAX=0.0D0 

R(2)=l 

DO  10  13=1,2 

R(3)=I3 

DO  10  14=1,3 

R  ( 4  )  =  I  4 

DC  10  15=1,4 

R(5)=I5 

DO  10  16=1,5 

R(6)=I6 

DO  10  17=1,6 

R(7)=I7 

DO  10  18=1,7 

R(6)=I8 

DC  10  19=1,8 

R(9)=I9 

DC  10  I  10=1,9 

R(10)  =  I  10 

CALL  CALCAN  (A,B,R,P) 
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TRIM=A( 10)/B(10) 
IFITRIM.LE.SMAX)  GO  TO  10 
SMAX=TRIM 
DO  3  J=2,12 
8  G(J)=R(J) 
10  CONTINUE 

WPITE(6,1001)P 
WRITE  (6,3000)  SMAX 
WRITE  (6,3001)(G(J)fJ=2,12J 
1001  FORMAT  (•  • ,T10, • PROBABILITY 

3000  FORMAT  (•  NTIO,'  A(N)/B(N)  =  «,F16.12) 

3001  FORMAT  {•  «,T10,«  ALGORITHM  ='//('  ,illI6),I0»J 
100  CONTINUE 

STOP 

END 
C 

C     SUBROUTINE 
C 

SUBROUTINE  CALCANi A,B , R* P) 

REAL*8    A(12),B(12) 

INTEGER**    R(12J 

Q=1.0D0-P 

A(1)=1.0D0 

B(1)=1.0D0 

A(2)=1.0DO 

8(2)=1.QD0 

DO    10    1=3,10 

X1=0.0D0 

X2=0.0D0 

L1=R(I) 

L2=I-1 

DO    20    L=L1,12 

X1=X1+P*A(L)*(Q**(L2-L) ) 
20    X2  =  X2  +  Q*B(L)=MP**(L2-L)  ) 

A(I  )  =  q**(  I-D+Xl 

B(I)  =  P**U-1)+X2 
10    CONTINUE 

RETURN 

END 
//GO.SYSIN  DD  * 
C 

C     INPUT  DATA 
C 

.51  .55  .60  .65  .70  .75  .80  .85  .90  .95  .99  .999 
C 
C 

C     FOR  CASE  OF  N=10t  THE  PROGRAM  ABOVE  WILL 
C     GIVE  THE  OUTPUT  FOR  EACH  P  IN  THE  FORM 
C       PROBABILITY  P  =  ... 
C       A(N)/B(N)      =  ... 
C       ALGORITHM      =  ... 
C 
C 

c 

C  UNFORTUNATELY,  THE  PROGRAM  TAKES  0.5  HRS. 

C  FOR  EACH  INPUT  P,  THAT  MEANS  WE  HAVE  TO 

C  USE  7.5  HRS  ALL  TOGETHER.  THE  PROGRAM 

C  BELOW  ILLUSTRATED  THE  WAY  TO  DECREASE  THE 

C  RUN  TIME  BY  ELIMINATING  SOME  ALGORITHMS 

C  WHICH  SHOULD  NOT  BE  EVALUATED  BECAUSE  OF 

C  THE  UNCORRELATED  PATTERN  TO  THE  OPTIMAL 

C  ALGORITHM  OBTAINED  IN  THE  CASE  OF  N   - 

C  SMALLER  THAN  10.  THE  RESULT  WAS  OBTAINED 

C  IN  ABOUT  7  SECONDS  OF  CPU.  TIKE  ON  ONE 

C  VALUE  OF  P  AND  WAS  CHECKED  OUT  CORRECTLY 

C  WITH   THE  CASE  OF  FULL  COMBINATIONS  WAS 

C  USED  FOR  THE  SAME  P. 


C 

c 


REAL*8  A( 12) ,B( 12) , SMAX, TRIM, P 
INTEGERS  R(l2),G(12) 
READ  (5,1000)Y 
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1000  FORMAT  (12F4.3) 
DC  6  M=l,12 
R(M)=0 

6  G(M)=0 

P=0.51D0 

SMAX=0.0D0 

R(2)=l 

R(3)=l 

R(4)=l 

DO  10  15=1,4 

R(5)=I5 

DO  10  16=1,5 

R(6)=I6 

00  10  I7=3t6 

R(7)=I7 

DO  10  18=4,7 

R(6)=I8 

DO  10  19=6,8 

R(9)=I9 

R(10)=9 

CALL  CALCAN  (A,8,R,P) 

TRIM=A( 10)/8(10) 

IF(TRIM.LE.SMAX)  GO  TO  10 

SMAX=TRIM 

DO  8  J=2,12 
8  G(JJ=R( J) 
10  CONTINUE 

WRITE(6,1001)P 

WRITE  (6,3000)  SMAX 

WRITE  (6,3001) (G{ J)  ,J=2?12) 

1001  FORMAT  (•  ' ,T10, 'PROBABILITY  P  =«,F6.3J 

3000  FORMAT  (•  ',T10,'  A(N)/6(N1  =  ',F16.12) 

3001  FORMAT  (•  '.TIO,'  ALGORITHM  =•//('  ,tllI6)f,0,J 
100  CONTINUE 

STOP 

END 
C 

C     SUBROUTINE 
C 

SUBROUTINE  C ALCAN ( A , B , R , P ) 

REAL*8  A(12),B(12) 

INTEGER*4  R(12) 

Q=1.0D0-P 

A(l)=1.000 

B(1)=1.0DO 

A(2)=1.0D0 

B(2)=1.0D0 

A(3)=1.0D0 

B(3)=1.0D0 

A(4)=1.0D0 

B(4)=1.0D0 

DO  10  1=5,10 

X1=0.0D0 

X2=0.0D0 

L1=R(I) 

L2=I-1 

DO  20  L=L1,L2 

X1=X1*P*A(L)*(Q**(L2-L) ) 
20  X2  =  X2  +  Q*B(L)*(P**(L2-D) 

Ad  J  =  Q**<  I-li+Xl 

B(I)=P**(  I-1J+X2 
10  CONTINUE 

RETURN 

END 
C 

c 

C  RESULT  OF  THIS  RUN  IS  ... 

C       PROBABILITY  P  =  0.51 

C       MN)/B(N)  =  1.1*196 

C  ALGORITHM    =111345789 

C 

c 
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APPENDIX  C 

//BCCC2894  JOB  ( 1205, 0530, R0X2 ),' CHODCHOEY  SMC  2399* 
C 

c 

C  THIS  PROGRAM  IS  USED  IN  SEARCHING  FOR 

C  THE  OPTIMAL  ALGORITHM  AT  THE  VALUE  OF  P 

C  EQUALS  .50+S  FOR  N=6,7,8,9  AND  10. 
C  THE  PROGRAM  WAS  PRESENTED  IN  THE 

C  FORM  WHICH  CAN  BE  USED  ONLY  FOR  THE  CASE 

C  OF  N=10,  FOR  THE  CASE  OF  SMALLER  N  WE 

C  CAN  PREPARE  THE  PROGRAM  AS  STATED  IN   - 

C  APPENDIX  fi 

c 

C 
C 

REAL*8  A(10) ,B(10) ,SMAX,TRIM,P 

INTEGERS  R(10)  ,G(10) 

P=0.50D0 

DO  6  M=l,10 

R(M)=0 
6  G(M)  =  0 

SMAX=O.ODO 

R<2)=1 

DO  10  13=1,2 

R(3)=I3 

DO  10  14=1,3 

R(4)=I4 

DO  10  15=1,4 

R(5)=I5 

DO  10  16=1,5 

R(6)=I6 

DO  10  17=1,6 

R(7)=I7 

DO  10  18=1,7 

R(8)=I8 

DO  10  19=1,8 

R(9)=I9 

DO  10  110=1,9 

R(10J=I 10 

CALL  CALCAN  (A,B,R,P) 

TRIM=B(  10J/AC10) 

IF(TRIM.GE.SMAX)  GO  TO  10 

SMAX=TRIM 

DO  8  J=2,12 
8  G(J)=R( J) 
10  CONTINUE 

WRITE(6,1001)P 

WRITE  (6,3000)  SMAX 

WRITE  ( 6,3001) (G( J  )  ,J=2,10) 
1001  FORMAT  (•  • ,T10, ' PROBABILITY  P  =',F6.3) 

3000  FORMAT  (•  STIO,'  B(N)/A(N)  =  ',F16.12) 

3001  FORMAT  (»  »,T10,«  ALGORITHM  =•//(•  ',1116),  '0') 
STOP 

END 
C 
C 

C     SUBROUTINE 
C 
C 


SUBROUTINE  CAL CAN ( A  ,  B , R , P ) 

REALMS  AtlO)tb(lO) 

INTEGERS  e(10) 

Q=1.GD0-P 

A(1)=1.0D0 

B(1)=0. ODO 

A(2)=1.0D0 

B(2)=0.0D0 

A(3)=1.0D0 

B(3)  =  0.0D0 

DC  10  1=3,10 

X1=0.000 

X2=O.0D0 
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X(3)  =  0.0D0 

ll=R(I) 

L2=I-1 

DC    20    L=L1,L2 

X1=X1+P*A(L)*(Q**(12-L) J 

X2  =  X2+(  (L2-U-1)*A(L)*(P**(L2-L)  ) 
20    X3=X3+B(L)*(P*#( (L2-LJ+1) ) 

A(I  )  =  Q**(  I-D+Xl 

B(I)  =  U-1)*P**( I-2J+X2+X3 
10    CONTINUE 

RETURN 

END 
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